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QUASI-LINE AR THEORY VIA THE CUMULANT EXPANSION APPROACH 

Frank C. Jones 
Thomas J. Birmingham 
Theoretical Studies Group 

ABSTRACT 

We use the cumulant expansion technique of Kubo to derive an integro- 
differential equation for <f>, the average one particle distribution function for 
particles being accelerated by electric and magnetic fluctuations of a general 
nature. For a very restricted class of fluctuations, the <f> equation degenerates 
exactly to a differential equation of Fokker-Planck type. Quasi-linear theory, 
including the adiabatic assumption, is an exact theory for this limited class of 
fluctuations. For more physically realistic fluctuations, however, quasi-linear 
theory is at best approximate. The nature of the approximation is seen by 
explicitly comparing the usual quasi-linear diffusion coefficient D 0 (which is 
second order in the fluctuations) to the fourth order contribution D 2 . Quasi- 
linear theory is asymptotic in the ratio of the wave-particle coherence time t c 
to the fluctuation induced deviation time r d or deflection time t d . Since r c 
varies over phase space, quasi-linear theory may be valid for some regions of 
phase space and invalid for others. 
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QUASI-LIN EAR THEORY VIA THE CUMULANT EXPANSION APPROACH 


1. INTRODUCTION 

Jokipii and Lerche (1973) have recently studied the statistical behavior of 
charged particles in a random magnetic field of one special type. They find that 
a differential equation of Fokker-Planck type exactly describes the evolution of 
the ensemble average one particle distribution function <f> . They compare 
the exact equation with the equation obtained by making the quasi-linear approxi- 
mation. The equations are identical provided that the adiabatic approximation 
is also made on the equation derived via quasi-linear theory. 

This Jokipii and Lerche work shows that for their special model, at least, 
quasi-linear theory is a very good approximation. Is the Jokipii-Lerche result 
a consequence of P air model or is it considerably more general? 

We answer this question by first deriving in Section 2, from the stochastic 
Vlasov Equation the general integro -differential equation which <f> obeys for 
arbitrarily varying (in space and time) electromagnetic fluctuations. We do so 
by using the cumulant expansion technique of Kubo (1962, 1963). We then de- 
termine in Section 3 the conditions which must be imposed on the fluctuations 
in order that <f > exactly satisfy a Fokker-Planck or diffusion equation. The 
conditions are so restrictive that no physically realistic model of fluctuations is 
included. 


1 



In Section 4 we examine the quasi -linear approximation (Drummond and 
Pines, 1962, 1964; Vedenov et al. , 1961, 1962; Bernstein and Engelmann, 1966; 
Aamodt and Drummond, 1964; Frieman and Rutherford, 1964; Rogister and 
Oberman, 1969; Kennel and Engelmann, 1966; Galeev and Karpman, 1963; 

Galeev et al. , 1965; Hall and Sturrock, 1967) to the general <f> equation in the 
context of the cumulant expansion. The quasi-linear approximation is shown to 
be a short coherence time approximation. The ratio of the neglected terms to 
the quasi-linear terms depends on the ratio of the coherence time t c to the de- 
viation time r d or the deflection time t d , whichever is shorter. The coherence 
time is the time required for the random component of the force on a ■ ven 
particle to become uncorrelated; the deviation time is the time required for the 
random force to significantly alter the particle's position; and the deflection 
time is the time for this same force to significantly alter the particle's velocity. 
If the condition r c < T d* T D P revai l s » quasi-linear theory describes the evolution 
of <f> very well. When t c is greater than either r d or t d , however, we do 
not expect quasi-linear theory to be at all accurate. 


2 



2. STOCHASTIC VLASOV EQUATION 


We consider the one particle distribution function f(t,x,v) that satisfies the 
Vlasov Equation for particles which interact with a force F. 


where 


+ <L 0 + L,)f = 0 


a a 

L ° sv -^ + V aT 


(I) 


is the non-stochastic Vlasov operator and L, = F t • 3/3v contains the stochastic 
part of the force (in general, <Lj> # 0). We assume that L 0 is constant in 
time although L t can, in general, be time dependent. 


Transforming to the interaction representation we have (Kubo, 1963) 


at 


+ ft(t) g = o 


( 2 ) 


where 


and 


g(t) = e L °‘f(t) 


ft(t) = e L ° t L 1 (t)e" L ° t . 


Equation 2 may be formally integrated by means of the time ordered expo- 
nential operator 


g(t) = T exp 




g(0). 


(3) 
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In Equation 3 the time ordering operator T orders the SI operators denoted by 
the exponential such that the operator at the earliest time operates first, the 
next earliest second and so on, i. e. 



+ (4) 

As long as the T operator stands to the left of an operator expression the oper- 
ators to its right may be treated as if they were commuting variables or c num- 
bers in any algebraic manipulations. This is so because the T operator contains 
and preserves the information about how the operators are to be ordered after 
the manipulations have been completed. 


Equation 3 may new be averaged over an ensemble of L, 's to obtain 


<g>(t) = 



(5) 


where we I. ? chosen g Q = g(o) to be uniform over the ensemble. It has been 
shown by Kubo (1962) that the averaged operator in Equation 5 may be written 
as a general cumulant expansion 



( 6 ) 
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{£ ir / f d, 2 •/ <it„<w,)n(t 2 )— si<t„)>A 

In = 1 ■'o *'o J 0 J 


= T exp 


where the brackets <> c refer to the cumulant average, i.e. 


(6 Cont’d) 


<A> C = <A>;<AB> C = <AB> - <A> <B> etc. 


(Kubo, 1962). 


The time ordering operator T applies to everything to its right including 
the expansions of the cumulant average < > c in terms of the conventional 
averages < >. For the cumulants with n > 3 the T operator notation becomes 
essential when expressing cumulant averages in terms of conventional averages. 
This can be seen as follows 

<ABC> = < ABO - <AXBC> - <BXAC> 

- <ABXC> + 2<AXBXC> . 

One will note that in order to express all of the required correlated averages 
using the conventional notation it was impossible to maintain the ordering, ABC . 
in all of the terms. An operator such as T standing to the left that tells in what 
order the various operators are to be applied removes this notational difficulty. 


If we introduce the notation 




dt n <n(t 1 )Jl(t 2 )n(t 3 ) - - - ft(t n )> c 


we may write 


<g>(t) = T ex 



H) n yt) k 


■} 


(7) 
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In an attempt to find a kinetic equation for <g> we differentiate Equation 7 with 
respect to time, noting that the time derivative commutes with T, to obtain 



This appears to be as far as we can go maintaining full generality. We must 
now examine what exact conditions the statistical operators ft(t) must satisfy 
in order that Equation 8 be equivalent to a Fokker-Planck equation or what ap- 
proximate conditions they must satisfy in order that Equation 8 may be approxi- 
mated by a Fokker-Planck equation. 
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3. CONDITIONS FOR EXACT KINETIC EQUATIONS TO EXIST 

We note first of all that the operator 3K n (t)/3t is an n th order differential 
operator. If we wish to obtain a kinetic equation of finite order the series in 
square brackets in Equation 8 must terminate at some finite n in general and at 
n = 2 in particular if we wish a diffusion or Fokker-Planck type equation. 


The only way to obtain this termination for all values of t is to require that 
all cumulant averages of the statistical operc.cor S2(t) higher than the second 
vanish identically. This is equivalent to requiring Jl(t) to be a Gaussian process. 
It thus appears that the first requirement that the operator fl(t) must satisfy is 
that it be a Gaussian process; in so far as cumulant averages of to powers 
higher than the second are non-vanishing the kinetic equation will deviate from a 
Fokker-Planck type equation by the inclusion of derivative operators of order 
greater than the second. 


If we now assume that fl(t) represents a Gaussian process Equation 8 becomes 


3<g> 

3t 





<I2(t)fl(r)> c dr 


X exp{-Kj + K 2 } g 0 . 


(9) 


Since t is the latest time in the expression the <ft(t)> in the square brackets 
is already time ordered and may thus be brought to the left of the T operator. 


7 



So doing and using Equation 7 gives 

t 

dr<S2(t)ft(T)> c exp{-Kj + K 2 } g 0 . (10) 

Since the variable r in the integral above ranges over all times we may not, in 
general, commute the <£2(t)£2(r)> c and T operations to obtain a kinetic equation 
for <g>. The effect of the T operator in Equation 10 is to mix the £2(r) operator 
with every term of the expansion of exp {-K t + K 2 } and keep us from having a 
clearly separate operator applied to <g>. 

The only circumstance under which Equation 10 becomes a purely differential 
equation for <g> is when the T operator is unnecessary. This occurs when the 
order in which the £2 operators are applied is unimportant. This can be so only 
if we have 

in(tj),«(t 2 )J = 0,t, * t 2 (11) 

the £2 operators must commute with themselves at different times. 

We have thus arrived at the second condition that must be met by the statis- 
tical operator £2(t). We Krill see immediately that these two conditions, Gaussie . 
statistics and commutivity at different times, are sufficient to guarantee that 
<f >, the ensemble averted distribution function, satisfies exactly a Fokker- 
Planck type kinetic equation. However, upon closer investigation the require- 
ment of commutiv’ty, though simply expressed in Equation 11, will turn out to 
impose stringent requirements on the physical systems. 
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If Equation 11 is satisfied we may disregard the T operator in Equation 10 
to obtain 

3<g>(t) f t 

+<n(t)Xg>(t) = J <Wmr)> c dr <g>(t) (12) 

*0 

Leaving the interaction representation, we may express Equation 12 in terms 
of <f> as 

+ L 0 +<L 1 (t)>\<f>(t) ' f <L 1 (t)e L o< r ' t) L 1 (r)e -L o (r - t) > c dr <f>(t) . 

V 1 0 (13) 

The operator Lj(t) represents the effect of the fluctuating force Fj at time 
t at a phase space pc.'nt x, v. The operator e L o (T ~ t> L 1 (r)e' L ° (T ” t) represents 
the effect of this force at time r and at the point in phase space where a particle 
at x, v at time t would have been at time r if it were undergoing the unperturbed 
motion generated by the operator L 0 . If the statistics of L, are stationary in 
time (which we s h a ll assume hereafter) and if there is a time r c such that L ,(t) 
and e L o (T ' t) L 1 (r)e“ L o (T " t) become uncorrelated for t-r > r c , the integral in 
Equation 13 becomes independent of t for t > t c and our kinetic equation becomes 
a Fokker-Planck equation with constant coefficients. Note that this loss of cor- 
relation can come about elthor through an intrinsic time variation in Lj (t) or 
through the pronagator e L ° (r-t> removing the particles a distance over which 
the fluctuating force is spatially uncorrelated or both. 
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Turning to the conditions that must be met by a system such that Equation 
13 is an exact kinetic equation for <f>, we can say little about the assumption 
of Gaussian statistics. It is a common assumption, partly because the central 
limit theorem (Feller, 1968) states that a tendency towards Gaussian statistics 
is a common occurrence, but mainly because of its mathematical tractability 
which most other types of statistics do not share. The requirement that ) 
and S*t 2 ) commute for t, # t 2 , on the other hand, has some rather severe 
implications. 


One way of fulfilling this requirement is to demand ft(tj ) = ft(t 2 ) for alltj , 
t 2 . This implies 


d& 

3t 


^LLeVuV^AWLL,- 1 .' = 0. 

° 1 0 t 1 o 


(14) 


Multiplying from the left with e -1 " 0 * and from the right with e L o t yields 

3L, 

— 1 + IVLJ = 0 (14') 

dt ° 1 


or in other words L, is a constant of the motion generated by L Q . 


A physical model of such a system could probably be constructed, but one 
suspects that such a model would be rather artificial. Also note that in this 
case the integral in Equation 12 or Equation 13 will grow linearly in time so 
that a time-divergirg diffusion coefficient results. 
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Another way of fulfilling the commutivity requirement is to have 


[L 0 , 1^(01 = 9 for all t 


( 15 ) 


and 


[L 1 (t 1 ),L 1 (t 2 )1 =0 foralltj,t 2 . (16) 

Most models discussed to date are of this type. Expressing L 0 and Lj in con- 
ventional variables L Q = v • d/dx + F Q • 3/3v and L t = Fj • d/3v we see 
that Equations 15 and 16 imply 


/ 3F. 3F, 3F \ 3 3 

( v + F 0 - Fj J F. — = 0 

\ 3x 3v 3v / 3v 3x 

/ 3F,(t 2 ) aF,(t,)\ 3 

( F ' <t ,) 4 ~ ir - ' f,<,2> - ir ~) • * * 0 • 


( 15 ') 

(16') 


The above expressions are quite complex and no general solutions have been 
found but one can make some general statements abou' them. First of all the 
coefficients of the velocity derivatives must vanish. One could merely require 
that the distribution function have no velocity gradients in these directions but 
distributions functions that are unbounded in velocity space are extremely un- 
physical. Fortunately this is not quite so true for spatially unbounded systems 
or we would be forced to require F, = 0 and the entire problem would evaporate. 
One can therefore require spatial homogeneity to the extent that is needed to 
keep F, • d/dx = 0. This homogeneity will be more or less isotropic depending 
on to what extent, if any, F, is constrained in direction. 
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The coefficient of the velocity gradient in Equation 15* can be considered as 


a partial differential equation for a general velocity and space dependent force. 
However, if we limit our discussion to electromagnetic forces the velocity de- 
pendence is specified and terms of different orders in v must be set equal to 
zero separately. This gives 

E 0 XB, = E,XB 0 (17) 

v.(VE x +B,B 0 -B 0 B 1 ) = 0 (18) 

v • VBj Xv = 0 (19) 

Since the velocity is a free parameter and generally not constrained in magnitude 
or direction the coefficients of v must vanish order by order. Thus 


VEj +BjB 0 -B 0 B, = 0 (18') 

VBj = 0 (19') 

Similarly from Equation 16’ , it follows that 

EjttjlXBjUj) = E,(t 2 ) X B,(t,) (20) 

Bj(t,) X B,(tj) = 0 (21) 


We may add the full set of Maxwell's Equations to Equations 17, 18', 19', 
20 and 21 above and ask what arrangement of fields are then allowed. It is 
straightforward to verify the following: 
a. B, must be parallel to B 0 . 
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b. Bj must be spatially homogeneous and constant in time; 

c. E, must be spatially homogeneous and only the component of E t that is 
parallel to B 0 may vary in time. E } thus represents an infinite wave- 
length plasma oscillation in which the plasma current exactly cancels the 
displacement current. 


This case of one dimensional plasma oscillations was treated by Lerche 
(1972), who assumed Gaussian statistics for the fluctuating electric field and 
derived an exact equation of the form of Equation 13. For this completely 
spatially homogeneous situation L 0 = 0, Lj = (q Ej(t)/m) • 3/dv, <1 *> = 0 
and from Equation 13 


a<f> 

at 




dr<E,(t)E 1 (T) > 


d 2 <f> 

3v 2 


( 22 ) 


where <Ej(t) Ej(t)> is here the correlation function for the fluctuating electric 
field. 


Lerche has compared his exact result, his analogue of Equation 22, with the 
time asymptotic diffusion equation derived by quasi-linear theory. He finds 
that the diffusion coefficients in the two equations differ by a facte* h a re- 
sult in contradiction with the later' conclusions of Joldpii and Lerche (1973) and 
with our own findings in the next section. We have found in Lerche’ s original 
work an algebraic error which accounts for the discrepancy. 
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While this elementary system is the only one in which both the commutivity 
of f2(tj) and fi(t 2 ) and Maxwell's Equations are rigorously satisfied, further 
latitude is available if we require the commutivity of Sl(tj) and Jl(t 2 ) by fields 
which only approximately satisfy Maxwell’s Equations. The model considered 
by Jokipii and Lerche (1973) falls into this latter category. 

Jokipii and Lerche consider a situation in which the only superposed electro- 
magnetic field is a time fluctuating, spatially homogeneous magnetic component 
Bj(t) (E Q = Ej = B q = 0). The induced electric field required by Maxwell's 
Equations V X Ej = c -1 3Bj /3t is assumed small and negligible, although it is 
clea T . as the authors admit, that E } can not be small everywhere in the infini te 
space. With the additional assumption that spatial variations be only along so 
that V = & z d/dz, L Q = v z d/dz, = coj(t) 3/80, and Equations 15' and 16’ are 
satisfied. Here cjj is the cyclotron frequency qBj/mc and 0 = tan -1 v y /v x is 
the gyrophase. In the main body of their paper, Jokipii and Lerche take Gaussian 
statistics for B } , assume <Bj> = 0, and come up with the identical result 

3<f> 3<f> f* d 2 <f> 

— +v z — = J dT<co,(t)a>j(r)> c — (23) 

0 

vhich we obtain from Equation 13. 

In their appendix Jokipii and Lerche take a different statistical model for 
B ] . Here B } has the same initial value b Q in each realization of the ensemble 
but otherwise vanes randomly such that the probability distribution P(B, ,t) 
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at a later time satisfies an Ornstein-Uhlenbeck (Uhlenbeck and Ornstein, 1930) 
equation 


3P 3 

— = (B.P) + <B, 2 > 


3t 3B 


l 


d 2 P 

3Bj 2 


(24) 


Their assertion is that the statistics of B, in this case need not be Gaussian. 


However, a quantity whose probability distribution satisfies Equation 24 is 
a Gaussian process; in fact it is a Markovian, Gaussian process (Doob, 1942) 
so no violation of our basic conditions is actually found here. 


If Bj(t) is an Ornstein-Uhlenbeck process with initial value b Q it is a 


J B,(r)dr 


Gaussian process with <Bj(t)> = b 0 exp-t/r c - The time integral 
is also a Gaussian process; Uhlenbeck and Ornstein (1930) give expressions for 
the mean and mean square of this integral. Recalling that the diffusion coefficient 
is one half the time derivative of the second cumulant average 
we readily obtain 


U1UUD1UU VA/C111V/ 

r 

of I Bj(T)dT 

■o 



0 — t/r d \ 


(25) 


3 2 

= <B, 2 (t->°o)> r c (l + e~ 2t/r c - 2e" t/T c)— <f> (t) . 

00 1 

Equation 25 differs from that of Jokipii and Lerche (1973) by the numerical coef- 
ficients on the terms of the diffusion coefficients and the algebraic sign of the 
first order 0-derivative term. We attribute this difference to algebraic errors 
on their part. Our diffusion coefficient reduces to zero at t = 0, as it should. 
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while theirs does not. Also, only the minus sign on the 3/3 0 term is consistent 
with a clockwise gyration of electrons as viewed in the direction of B t . 

We see that the cumulant expansion method very quickly yields kinetic 
equations when they are exactly derivable, and it further makes quite explicit 
the rather stringent requirements that must be fulfilled in these cases. We 
shall now see how the quasi-linear approximation fits into such a scheme and 
estimate the order of magnitude of the error that is involved. 
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4. APPROXIMATE KINETIC EQUATIONS 


If we return to Equation 10 and insert the appropriate expressions for 
and K 2 we may write it as 


d<g> 

dt 


+ <fl(t)Xg> = T 



<f2(t)J2(r)> c dr 


X exp 





where 0(rj )is the operator 


00 ') 


0(t x ) = )> - f 1 dr 2 <S2(r 1 )S2(r 2 )> c . 

J 0 


We note once again that it is the mixing of the operator S2(r) with the exponential 
as required by the operator T that prevents us from immediately writing down an 
exact kinetic equation for <g> and hence for <f>. All of the difficulty arises 
from that region of the integration over where t > t x > t for when . > t x the 
operator £2(r) is already properly ordered and T may be brought through to act 
directly on the exponential operator. 


If it can be shown that the region of the t x integration where t > r y > t is 
relatively unimportant an approximation may be based on this fact. Such a situa- 
tion may arise if it can be shown that the operator < £2(t) £2(r) > c goes to zero if 
r is sufficiently different from t. If a ’’correlation time” r c exists such that 
<Jl(t) J2 (t)> c -*■ 0 for t - r > t c and if t > r c the significant contribution to 
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the t integral will come from those values of r within a few correlation times of 
t. Hence only for t > Tj > t - Nr c , N = 0(1) will there be a problem with the 
T operator and this region of r x , being small compared to t - Nr c > r x > 0, 
may perhaps be neglected. 

The effect of the region t > rj > t - Nr c may be neglected in at least two 
different ways. The first would be to neglect the effect of the T operator in this 
region and move it through to the exponential operator as un approximation. This 
approach, of course, leads to Equation 13 but this time as an approximation 
rather than an exact equation. The difficulty with this approach is that it is vir- 
tually impossible to tell just what has been neglected and thus estimate the va- 
lidity of the approximation. 


The second method does not suffer from this defect. Note first of all that 
since the T operator gives the complete prescription as to how the SI operators 
are to be ordered in any expansion we may treat them as c-numbers as long as 


they are to the right of T. We therefore may write 




dTiOfrj) 
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The approximation consists of neglecting the second term of the right hand side 
of Equation 26 and keeping only the first term. 

Since $2 (r) and the retained term of Equation 26 are now properly time or- 
dered we may bring the T operation through to the exponential operator to obtain 
after some algebra. 

(~ + L q + <Lj (t)>) <f> (t) = / <L 1 (t)e L o< T - t >L 1 (T» c <f>(r)dT. (27) 


It can be readily seen that we have arrived at the standard quasi-linear 
approximation to the kinetic equation for <f>. The term that we have neglected 


T 



(28) 


may be expanded in a systematic manner by first expanding the exponential in 
square brackets in a power series and then breaking up the second exponential 
into two regions of t 2 as was dons before. This process may be continued in- 
definitely to produce the standard series although this method of generating it is 
more difficult than is the standard approach. We shall return shortly to a more 
detailed analysis of this remainder term. 


It is interesting to note that if starting with Equation 27 we make the addi- 
tional approximation usually referred to as the "adiabatic hypothesis", namely 
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replace <f>(r) by <f>(t) and shield it from the free streaming propagator 
e L ° (r ” ^ by placing e _L ° (r in front of it, one obtains Equation 13 that we 
previously obtained directly by simply neglecting the effect of the T operator in 
the region > t . We see therefore that the "adiabatic hypothesis" is on an 
equal footing with the quasi-linear approximation itself and it would be inconsis- 
tent with the spirit of the scheme to accept one and reject the other. In all like- 
lihood the two approximations stand or fall together. 

We must now address ourselves to the question of the validity of our neglect 
of the term given by expression 28. We have assumed that a correlation time r c 
exists such that the r integral cuts off for t - r > r c . This implies that the 
integral in the exponent I dr, 0(t { ) will extend over at most a few times t c and 

T 

the hope is that this contribution will be in some sense small. Since 0(j x ) con- 
tain derivative operators that operate on the distribution function as well as the 
force field function the operator can not be bounded in any rigorous sense. To 
proceed we must therefore make some assumptions about the distribution func- 
tion. We shall therefore assume that the gradients in phase space of the distri- 
bution function are negligible compared to the gradients of the random force 
function. This is well within the spirit of diffusion theory where one expects to 
be able to describe the evaluation of a distribution function only on time and length 
scales that are large compared to the correlation scale of the random forces. 



On making this assumption one can now write the right hand side of Equation 
10 as a diffusion operator 


where D 0 is the quasi- linear diffusion coefficient and Dj , D 2 etc, are correction 
terms generated by the quasi-linear series. For the sake of simplicity we shall 
set <n>= 0 in the following and compare the terms D 0 and D 2 (Dj is now = 0). 


Before continuing we will need to introduce some additional notation. We 
shall employ the six-dimensional phase space position vector z = (x, v) and a 
six dimensional force vector f g = (v, F 0 ) » Fj = (0, Fj ) . With this notation we 
have 


L 


o 



+ F„ 


_ 3 _ 

dv 


and 

d 9 

Furthermore since we will need scalar products of quantities that may be sepa- 
rated from each other in an expression we will occasionally make use of a com- 
ponent notation with summation of repeated indices implied, i.e. , 

F ° ' 9z = Foi 9z[ = S Foi 9zj * 

1 i=l 1 

Recalling that setting <ft> = 0 makes 0(r x ) = - f dr 2 <fi(r 1 )S2(r 2 )> 
we may turn to Equation 28 and expand the exponential in square brackets keeping 
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only the lowest order term to obtain the approximate expression for the neglected 
term. 



We see that for the range 0 < r 2 < r theT operator once again mixes Si(r 2 ) with 
the exponential. We may treat this problem as we did before by neglecting the 
region between r 2 - r in the exponent integral whenever r 2 < r . 


We then have two terms 
•t yt 


T f dr f d Tl <SXt)Q(r)> f f dr 2 <ft(r 1 )ft(r 2 )> 
U T l J 0 


X exp u: dr 3 0(T 3 •f dr 2 <n(r,)n(r 2 );' 

X exp 

= T f dr f dr l <Sl(t)Sl(r)> f f dr 2 <fi(T,)n(r 2 )><g> (r 2 ) 
*0 r L*t) 


( 29 ) 


j 1 dr 2 <fl(r 1 )ft(r 2 )><g>(r)| . 


We may now relabel the variables In the first term by interchanging the 
labels t and r 2 and interchange the order of integration to reinstate the original 
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order to obtain 


- f • f f dr 2 <^<S2(t)S2(T)><£2(T )£2(t 2 )> 


•t yt 

"0 t 'r 

+ <n(t)ft(r 2 )><S2 (Tj )£2(t)>| ^g> (r) 


(30) 


T nserting the appropriate expressions for the £2 factor and converting from <g> 
to <f> we obtain for the second order diffusion coefficient D 2 the expression 

D 2 (t) = j dr j dr, j * dr 2 <4 1 (t)e‘ I 'o< t - r i>F 1 (T 1 ) - A 


e _L o^ T i _T 2^F, (r 2 ) • — e ‘ L ° (T 2' T) F (r)) 
‘ 1 dz 1 / 


(31) 


where the primed brackets means that in taking the average only two of the three 
possible correlations are included, i.e. , <t, Tj , r 2 , r>' = <t, r 2 > <Tj , t> 

+<t, t> <Tj , r 2 > where here the groupings indicate correlations only and not 
the ordering of the operator. 


Expression 31 as it stands is not very transparent and it would be difficult 
to estimate the relative magnitude of D 2 with respect to D Q from it in this form. 
We will therefore transform it to a more recognizable form so that it will be 
easier to visualize what physical parameters govern the size of D 2 as compared 
to D 0 . 

It will be useful to first review the algebraic properties of the Lie operator 
e" L o t and the characterictic solution z*(z, t) and z°(z,t) . z* and r° are 
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solutions of the respective differential equations 

(-^7 + L 0 + L, j z'izM = 0 

(it* L “) z ° (2 t) = ° 

with the coiomon initial value 


(32) 

(33) 


Z*(Z,o) = z°(z, o) = 2 


(34) 


It is readily seen that z* and z° are the phase space positions at a time -t 
of a particle which is moving in the force fields F c + F, and F 0 respectively and 
is at the point z at t = 0 . Equation 33 may be solved hy the Lie operator as 

z°(z, t) = e" L ° , z 

and hence the effect of this operator on the derivative operator is 


And cleaJ.y 


® 9 _L t ^ z i 9 i* 

e o* e ° = — L — e ~L 0 t 

dz ' °(t) 3z° dz i 


9 -l t 3z i 0) - L t 3 

e L o l = — 1 e ° 

dz 3z dz i 


(35) 


(36) 


It should be further noted that any function of z*(z°) is itself a solution of Equa- 
tion 32 (Equation 33). 


Equation 32 may be cast as an integral equation using the solution to 33, z° 
and the Lie operator as 


z*(t,r) * z°(t-r) 




, z *(f j,r) . 


(37) 


24 



This integral equation may be solved by iteration in a straightforward manner. 
The result may be expressed in terms of the deviation of the Vlasov trajectory z* 
from the free streaming trajectory z° as 

z*(t,r) - z°(t-r) = Az(t,r) = Az 1 + Az 2 + Az 3 + . . . 


where 


(38) 


Az'(t,T) = - J dT 1 e' L o^ t - T i)F 1 (r 1 ) - z°(r,-r) 

T 

Az 2 (t,r) = -f dr j e _L °^ t-T 1 Vj (Tj ) • ^-Az^Tj.t) 

J T 

= f dr 1 e- L ° (t_T i ) F 1 (r 1 ). ^ f dr^o^r^^) ■ £ z°(r 2 -r) 

V •'r 

T T (39) 

etc. We shall return to these equations shortly. If we now return to Equation 31 

we note that the expression can be broken into two terms, one proportional to 

dFj (r)/3z and the other proportional to d 2 Fj (r)/dz 2 . Adopting the convention 

that parentheses bound the range of the .Lie operators and the derivative operator 

and noting that 

(e" Lot ABC. . .) = (e^o* A) (e -L o t B) (e -1 ^ 1 C). . . 


we obtain 


D 2 (t) = / dT / dT i / * dT 2 (e‘ L ° (t ‘ Tl) Fi • ^ 2 -l o (t i- t 2) 

0 r r \ 


XF U (r 2 ) 


aZ °j (T 3- T) \ /-L 0 (t-r) 3F M\ ' 


(40) 




37 i 


25 




(40 Cont’d.) 


U slug the group property of the Lie operator, i. e. , 

e ' L o t i z O(t 2 ) = t ~ L o t i c ~ L o t iz = e“ L o^i +t 2^z = z^tj+tj) 
and Equation 35, we may move 3z° rn (r 2 -r)/3z- in the second term to the right 
through the operator e~ L ^ T i~ T2 ^ to obtain for the second term 


second term 


- * jf dr l jf ' *i <P|(« ^- L o<*-’l> Fll (r,) 82 



(41) 


(J i <>,->) (e- L “ < - T) 5S))>' ■ 


The last step was justified by the symmetry of the integral under the exchange 


r, **■ r 2 . 
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Since the integral over Tj and t 2 in the first parentheses oi Equation 40 is 
just equal to expression 39 for Az 2 (t,r) and the first two parentheses of expres- 
sion 41 are each equal to Az*(t,f), we may write D 0 and D 2 together as 


D 0 + D 


2 



(F^r)) 


Z = Z°(t-T) 


+ Az 2 L (t,r) 



Z = Z°(t— T ) 



/3 2 F,(t)\ l\ # 

(Az 1 j(t,r) Az 1 j(t,r))(— — - — ) \ * 

‘ W z j / t = z«(.-r) / 


(42) 


D 0 is evaluated by propagating Fj (t) along the free streaming orbits z°(t-r) ig- 
noring the effect of Fj on the orbit. Clearly D 2 is trying to correct this neglect 
to second order in Fj via a Taylor's series in the orbit corrections Az (t,r) . 

This picture is not strictly correct because of the prime in the averaging brackets 
meaning that one of the terms that would be present in the normal average is not 
present. 


D o + D 2 + D 4 + 


f\r 


<F j (z,t)F , (z*(t-r ),r)> 


(43) 


However, the correction terms do appear to be proportional to the trajectory cor- 
rections to the appropriate order such that if Az = 0 or if Fj were not dependent 
on z, D q would be exact. 
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5. DISCUSSION 


Our assertion in the Introduction that quasi-linear theory is a short coher- 
ence lime approximation now has clearer meaning. We see from Equation 42 
♦hat D 2 ^ D 0 provided that lAz • 3/3z Fj l rms ^ |F t l rms for the length of time 
tnat contributes significantly to the integrand. For the spatial components of 
z, lAz • 3/3z Fj l rms se |Fj l rn)S defines the deviation time r d , that interval 
over which the average particle’s actual position deviates by an amount the order 
of one correlation length x c from its zeroth order position. For the velocity 
components of z, |Az • 3/3z Fj l rms IFj l rms defines the deflection time r D , 
that interval over which the average Av becomes of the order of the unperturbed 
v . The cut-off time of the integrand is produced by the incoherence of F, (t) and 
Fj (r) for times t - r > r c . Thus D 2 « D 0 requires T d , r D > t q . 

For those restricted physical systems which satisfy Equations 17-19 and 
thus lead to an exact Fokker -Planck description of <f>, the fluctuating force 
Fj is independent of phase space position and thus both and ? D are infinite. 

For realistic physical systems r d and/or r D are finite and quasi-linear 
theory provides at best an approximate description of <f > . For any given 
system, however, quasi-linear theory may not be a uniformly adequate approxi- 
mation, since r c can vary over phase space and become arbitrarily long in 
certain regions. 
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Kaiser, Jones, and Birmingham (1973) pointed out this fact in their paper 
discussing diffusion in the presence of only a static, spatially fluctuating mag- 
netic field Q z 5B(x). (x, y, z are Cartesian coordinates here. ) Zeroth order 
orbits are x( t) = x(t) + v cos® (r - t), v = (v y 2 +v x 2 ) % = const, (where 
tan® = Vy/v x ) in this 2 dimensional model. For particles with no initial v x , 
i. e. , cos® = 0, Fj [x(t>] and Fj [x(t)] are correlated for an arbitrarily long 
time so tnat r c -*• «». Quasi-linear theory is useless for describing these particles. 

This same inadequacy of quasi-linear theory manifests itself in the problem 
of magnetically induced pitch angle scattering (Hasselman and Wibberenz, 1968; 
Kulsrud and Pearce, 1969; Jokipii, 1971, 1972;Klimas and Sandri, 1971, 1973), 
where the pitch angle 0 is defined with respect to a unif orm average background field 
B 0 ^ x . For this situation Am ^Sb/B c (M = cos®). Ax — 5B v t/B 0 and thus r d = 
B 0 x c / 5Bv. But t c = x c /mv and hence r c /r d = 5B/ mB 0 . If this ratio is to be 
smal., JB/ #iB 0 < 1: quasi-linear theory is applicable only to particles which 
have m > 5 B/B 0 (Jones, Birmingham, and Kaiser, 1973). 

The shortcomings of quasi-linear theory can be improved upon by incorpora- 
ting some effects of the fluctuations in the lowest order orbits z 0 (Dupree, 1966; 
Weinstock, 1968, 1969; Birmingham and Bornatici, 1971; Mikhailovskii, 1964; 
Al'tshul’ and Karpman, 1965; Orszag and Kraichnan, 1967; Aamodt, 1967; 

Gaieev, 1967; Ichimaru, 1970; Rudakov and Tsytovich, 1971; Volk et al. , 1973). 
The fluctuations thus assist in removing particles from correlated regions of 
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phase space. Jones, Kaiser, and Birmingham (1973) propose a partial averag- 
ing procedure and show that for the pitch angle scattering problem ordinary 
quasi-linear theory is modified by the replacement of n by (n 2 + 8B 2 /B 0 2 )*• 
The correlation time y^/v (m 2 + 5B 2 /B 0 2 ) w is thus finite for all regions of phase 
space. 
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